Computing tips for awesome research and an easy life

Robert Turner, University of Sheffield RSE Team September, 2021

Acknowledgements

Contains elements from Reproducible Research Data and Project Management in R, by Anna Krystalli and from Methods in Research Software Engineering by David Wilby.

About me

Bob Turner

Mix of software engineering and research experience.

RSE Team

RSE

13 RSEs, 35 projects / year worth ~£11m total

This presentation is flawed

Focusses on what to do, not how to do it.

In this session…

Discussion

What are the characteristics of well engineered research software?

Link to interactive doc

“Good” research software

  • Version control
  • Automated tests
  • Controlled execution environment
  • Documentation
  • Parameterised

https://joss.readthedocs.io/en/latest/review_checklist.html

Data Management

Data Management Plan

  • Start early. Make an RDM plan before collecting data.
  • Anticipate data products as part of your thesis outputs.
  • Think about what technologies to use.

Own your data

Take initiative & responsibility. Think long term.

Spreadsheets?

Do you agree?

Excel

But good for data viewing / entry, sometimes, perhaps…

Databases

Have a look at the Data Carpentry SQL for Ecology lesson

Data formats

  • .csv: comma separated values.
  • .tsv: tab separated values.
  • .txt: no formatting specified.

more unusual formats will need instructions on use.

Ensure data is machine readable

Robot reading

bad

bad

good

ok

  • could help data entry
  • .csv or .tsv copy would need to be saved.